SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT : Bisgard-Frantzen, Henrik 
S vends en, Allan 
Borchert, Torben Vedel 

(ii) TITLE OF INVENTION: AMYLASE VARIANTS 

(iii) NUMBER OF SEQUENCES: 32 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 405 Lexington Avenue, Suite 64 0 0 

(C) CITY: New York 
(DJ STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10174-6401 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS / MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/600,656 

(B) FILING DATE: 13-FEB-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Lambiris, Elias J. 

(B) REGISTRATION NUMBER: 33,723 

(C) REFERENCE/DOCKET NUMBER: 4318.204-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 867 0123 

(B) TELEFAX: 212 867 0298 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 IS 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly He Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 
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Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 HO 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 1^0 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
ISO 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie A dp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
• 260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp He He Gly Trp Thr Arg Glu 
405 410 415 



Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

Val Trp Val Lys Gin 
485 



INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
I 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 

20 25 30 

Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Gly 
85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 HO 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu He Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

He Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 . 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 
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Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 

210 215 * 220 

\ 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
22S 230 235 ** : 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu lie Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

Lys Xle Asp Pro He Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp He Thr Gly Asn Lys Pro Qly Thr Val Thr He 
450 455 <?60 

Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

He Trp Val Lys Arg 
485 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
15 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 \ 30 

Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
35 40 45 

Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 

Leu Gly Glu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 
as 90 95 

Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
100 105 HO 

Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 
115 120 125 

Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 14Q 

Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 17S 

Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
ISO 185 190 

Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
195 200 ; 205 

Pro Glu Val Val Thr Glu Leu Lys Ser Trp Gly Lys Trp Tyr Val Asn 
210 215 220 

Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys 
225 230 235 240 

Phe Ser Phe Phe Pro Asp Trp Leu Ser Asp Val Arg Ser Gin Thr Gly 
245 250 255 

Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys 
250 265 270 

Leu His Asn Tyr He Met Lys Thr Asn Gly Thr Met Ser Leu Phe Asp 
275 280 285 

Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Thr 
290 295 300 

Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro 
305 310 315 320 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 
325 330 335 

Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
34 0 345 35Q 
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Phe He 

Tyr Tyr 
370 

Asp Pro 

385 

Asp Tyr 
Thr Glu 



Gly Gly 



Phe Tyr 
450 

Asp Gly 
465 

Val Pro 



Arg Pro 



Ala Trp 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CATCATAATG 


GAACAAATGG 


TACTATGATG 


CAATATTTCG 


AATGGTATTT 


GCCAAATGAC 


60 


GGGAATCATT 


GGAACAGGTT 


GAGGGATGAC 


GCAGCTAACT 


TAAAGAGTAA 


AGGGATAACA 


120 


GCTGTATGGA 


TCCCACCTGC 


ATGGAAGGGG 


ACTTCCCAGA 


ATGATGTAGG 


TTATGGAGCC 


180 


TATGATTTAT 


ATGATCTTGG 


AGAGTTTAAC 


CAGAAGGGGA 


CGGTTCGTAC 


AAAATATGGA 


240 


ACACGCAACC 


AGCTACAGGC 


TGCGGTGACC 


TCTTTAAAAA 


ATAACGGCAT 


TCAGGTATAT 


300 


GGTGATGTCG 


TCATGAATCA 


TAAAGGTGGA 


GCAGATGGTA 


CGGAAATTGT 


AAATGCGGTA 


360 


GAAGTGAATC 


GGAGCAACCG 


AAACCAGGAA 


ACCTCAGGAG 


AGTATGCAAT 


AGAAGCGTGG 


420 


ACAAAGTTTG 


ATTTTCCTGG 


AAGAGGAAAT 


AACCATTCCA 


GCTTTAAGTG 


GCGCTGGTAT 


480 


CATTTTGATG 


GGACAGATTG 


GGATCAGTCA 


CGCCAGCTTC 


AAAACAAAAT 


ATATAAATTC 


540 


AGGGGAACAG 


GCAAGGCCTG 


GGACTGGGAA 


GTCGATACAG AGAATQGCAA 


CTATGACTAT 


600 


CTTATGTATG 


CAGACGTGGA 


TATGGATCAC 


CCAGAAGTAA 


TACATGAACT 


TAGAAACTGG 


660 
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Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 
355 360 365 

Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser Lys He 
375 380 

Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 

390 395 400 

Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu Gly Val 
405 410 415 

Lys Pro Gly Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 
420 425 430 

Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 
435 440 445 

Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ser 
455 460 

Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
470 475 480 

Arg Lys Thr Thr Val Ser Thr lie Ala Trp Ser He Thr Thr 
485 490 495 



Trp Thr Asp Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val 
500 505 510 



GG AGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 720 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 780 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 840 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA TGATATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAG^CCGG GGAAGCATTG 1020 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 1140 

GCTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 1260 

CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 13 80 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 1440 

GTTTGGGTGA AGCAA 1455 

(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1455 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CATCATAATG 


GGACAAATGG 


GACGATGATG 


CAATACTTTG 


AATGGCACTT 


GCCTAATGAT 


60 


GGGAATCACT 


GGAATAGATT 


AAGAGATGAT 


GCTAGTAATC 


TAAGAAATAG 


AGGTATAACC 


120 


GCTATTTGGA 


TTCCGCCTGC 


CTGGAAAGGG 


ACTTCGCAAA 


ATGATGTGGG 


GTATGGAGCC 


180 


TATGATCTTT 


ATGATTTAGG 


GGAATTTAAT 


CAAAAGGGGA 


CGGTTCGTAC 


TAAGTATGGG 


240 


ACACGTAGTC 


AATTGGAGTC 


TGCCATCCAT 


GCTTTAAAGA 


ATAATGGCGT 


TCAAGTTTAT 


300 


GGGGATGTAG 


TGATGAACCA 


TAAAGGAGGA 


GCTGATGCTA 


CAGAAAACGT 


TCTTGCTGTC 


360 


GAGGTGAATC 


CAAATAACCG 


GAATCAAGAA 


ATATCTGGGG 


ACTACACAAT 


TGAGGCTTGG 


420 


ACTAAGTTTG 


ATTTTCCAGG 


GAGGGGTAAT 


ACATACTCAG 


ACTTTAAATG 


GCGTTGGTAT 


480 


CATTTCGATG 


GTGTAGATTG 


GGATCAATCA 


CGACAATTCC 


AAAATCGTAT 


CTACAAATTC 


540 


CGAGGTGATG 


GTAAGGCATG 


GGATTGGGAA 


GTAGATTCGG 


AAAATGGAAA 


TTATGATTAT 


600 


TTAATGTATG 


CAGATGTAGA 


TATGGATCAT 


CCGGAGGTAG 


TAAATGAGCT 


TAGAAGATGG 


660 


GGAGAATGGT 


ATACAAATAC 


ATTAAATCTT 


GATGGATTTA 


GG AT CGATGC 


GGTGAAGCAT 


720 


ATTAAATATA 


GCTTTACACG 


TGATTGGTTG 


ACCCATGTAA 


GAJ ACGCAAC 


GGGAAAAGAA 


780 
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ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 840 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 

TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATG^AACGGT TGTTCAAAAG 960 

CATCCAATGC ATGCCGTAAC TTTTGTGGAT AATCACGATT CTCAACCTGG GGAATCATTA 1020 

GAATCATTTG TACAAGAATG GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 1080 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 1140 

GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC ATATGGAACA 1200 

CAACATGATT ATTTTGACCA TCATAATATA ATCGGATGGA CACGTGAAGG AAATACCACG 1260 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 1320 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 13 80 

ACAGTTACGA TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 1440 

ATTTGGGTGA AACGA 1455 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GCCGCACCGT 


TTAACGGCAC 


CATGATGCAG 


TATTTTGAAT 


GGTACVTGCC 


GGATGATGGC 


60 


ACGTTATGGA 


CCAAAGTGGC 


CAATGAAGCC 


AACAACTTAT 


CCAGCCTTGG 


CATCACCGCT 


120 


CTTTGGCTGC 


CGCCCGCTTA 


CAAAGGAACA 


AGCCGCAGCG 


ACGTAGGGTA 


CGGAGTATAC 


180 


GACTTGTATG 


ACCTCGGCGA 


ATTCAATCAA 'AAAGGGACCG TCCGCACAAA ATACGGAACA 


240 


AAAGCTCAAT 


ATCTTCAAGC 


CATTCAAGCC 


GCCCACGCCG 


CTGGAATGCA 


AGTGTACGCC 


300 


GATGTCGTGT 


TCGACCATAA 


AGGCGGCGCT 


GACGGCACGG 


AATGGGTGGA 


CGCCGTCGAA 


360 


GTCAATCCGT 


CCGACCGCAA 


CCAAGAAATC 


TCGGGCACCT 


ATCAAATCCA 


AGCATGGACG 


420 


AAATTTGATT 


TTCCCGGGCG 


GGGCAACACC 


TACTCCAGCT 


TTAAGTGGCG 


CTGGTACCAT 


480 


TTTGACGGCG 


TTGATTGGGA 


CGAAAGCCGA 


AAATTGAGCC 


GCATTTACAA 


ATTCCGCGGC 


540 


ATCGGCAAAG 


CGTGGGATTG 


GGAAGTAGAC 


ACGGAAAACG 


GAAACTATGA 


CTACTTAATG 


600 


TATGCCGACC 


TTGATATGGA 


TCATCCCGAA 


GTCGTGACCG 


AGCTGAAAAA 


CTGGGGGAAA 


660 


TGGTATGTCA 


ACACAACGAA 


CATTGATGGG 


TTCCGGCTTG 


ATGCCGTCAA 


GCATATTAAG 


720 


TTCAGTTTTT 


TTCCTGATTG 


GTTGTCGTAT 


GTGCGTTCTC 


AGACTGGCAA 


GCCGCTATTT 


780 


ACCGTCGGGG 


AATATTGGAG 


CTATGACATC 


AACAAGTTGC 


ACAATTACAT 


TACGAAAACA 


840 


GACGGAACGA 


TGTCTTTGTT 


TGATGCCCCG 


TTACACAACA 


AATTTTATAC 


CGCTTCCAAA 


900 
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TCAGGGGGCG 


CATTTGATAT 


GCGCACGTTA 


ATGACCAATA 


CTCTCATGAA 


AGATCAACCG 


960 


ACATTGGCCG 


TCACCTTCGT 


TGATAATCAT 


GACACCGAAC 


CCGGCCAAGC 


GCTGCAGTCA 


1020 


TGGGTCGACC 


CATGGTTCAA 


ACCGTTGGCT 


TACGCCTTTA 


TTCTAACTCG 


GCAGGAAGGA 


1080 


TACCCGTGCG 


TCTTTTATGG 


TGACTATTAT 


GGCATTCCAC 


AATATAACAT 


TCCTTCGCTG 


1140 


AAAAGCAAAA 


TCGATCCGCT 


CCTCATCGCG 


CGCAGGGATT 


ATGCTTACGG 


AACGCAACAT 


1200 


GATTATCTTG 


ATCACTCCGA 


CATCATCGGG 


TGGACAAGGG 


AAGGGGGCAC 


TGAAAAACCA 


1260 


GGATCCGGAC 


TGGCCGCACT 


GATCACCGAT 


GGGCCGGGAG 


GAAGCAAATG 


GATGTACGTT 


1320 


GGCAAACAAC 


ACGCTGGAAA 


AGTGTTCTAT 


GACCTTACCG 


GCAACCGGAG 


TGACACCGTC 


1380 


ACCATCAACA 


GTGATGGATG 


GGGGGAATTC 


AAAGTCAATG 


GCGGTTCGGT 


TTCGGTTTGG 


1440 


GTTCCTAGAA 


AAACGACCGT 


TTCTACCATC 


GCTCGGCCGA 


TCACAACCCG 


ACCGTGGACT 


1500 


GGTGAATTCG 


TCCGTTGGAC 


CGAACCACGG 


TTGGTGGCAT 


GGCCTTGA 




1548 



(2) INFORMATION FOR SEQ ID NO: 7: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNEo 3 : single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: peptide 



Gin Tyr Phe Glu Trp Tyr 
15 

Leu Asn Ser Asp Ala Ser 
30 

Trp lie Pro Pro Ala Trp 
45 

Gly Ala Tyr Asp Leu Tyr 
60 

Val Arg Thr Lys Tyr Gly 
75 80 

Ser Leu Lys Asn Asn Gly 
95 

His Lys Gly Gly Ala Asp 
110 

Asn Pro Asn Asn Arg Asn 
125 

Ala Trp Thr Arg Phe Asp 
140 

Phe Lys Trp Arg Trp Tyr 
155 160 

Arg Arg Leu Asn Asn Arg 
175 



■ w 

■ U 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His His Asn Gly Thr Asn Gly Thr Met Met 
15 10 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg 
20 25 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val 
35 40 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr 
50 55 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr 
65 70 

Thr Arg Ser Gin Leu Gin Ala Ala Val Thr 
85 90 

He Gin Val Tyr Gly Asp Val Val Met Asn 
100 105 

Ala Thr Glu Met Val Arg Ala Val Glu Val 
115 120 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu 
130 135 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser 
145 150 

His Phe Asp Gly Val Asp Trp Asp Gin Ser 
165 170 
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lie Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 - 24.0 

lie Lys Tyr Ser Phe Thr Arg Asp Trp lie Asn His Val Arg Ser Ala 
245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Arg Asn He Phe Asn Gly Thr Val Val Gin Arg 
305 310 315 320 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 350. 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Arg Ser 
370 . 375 380 

Lys lie Asp Pro lie Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 
385 390 395 400 

Gin Asn Asp Tyr Leu Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Ser Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
* 450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

He Trp Val Asn Lys 
485 
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(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS.: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCTGCGGTGA CCTCTTTAAA AAATAACGGC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCACCGCTAT TAGATGCATT GTAC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(ii- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CTTACGTATG CAGACGTCGA TATGGATCAC CC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCCATATC GACGTCTGCA TACGTAAGAT AGTC 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTASGGGCAA GGCCTGGGAC TGG 
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(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCCAGGCCTT GCCCSTAAAT TTATATATTT TGTTTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGTTTCGGTT CGAAGGATTC ACTTCTACCG C 



(2) INFORMATION FOR SEQ ID N0;15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGGTAGAAG TGAATCCTTC GAACCGAAAC CAG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l6: 
GGTACTATCG TAACAATGGC CGATTGCTGA CGCTGTTATT TGC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
CTGTGACTGG TGAGTACTCA ACCAAGTC 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18: 

CTACTTCCCA ATCCCAAGCT TTACCTCGGA ATTTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAATTCCGA GGTAAAGCTT GGGATTGGGA AGTAG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 20: 
TTGAACAACC GTTCCATTAA GAAG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT TGTCCTGAAT TTATATATTT TGTTTTGAAG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22; 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT GCCTCCGAAT TTATATATTT TGTTTTGAAG 
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(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

ATGTGTAAGC CAATCGCGAG TAAAGCTAAA TTTTATATGT TTCACTGCAT C 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCACCAAGGT CATTTCGCCA GAATTCAGCC ACTG 



{2) INFORMATION FOR SEQ ID NO:2S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TGTCAGAACC AACGCGTATG CACATGGTTT AAACCATTG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ACCACCTGGA CCATCGCTGC AGATGGTGGC AAGGCCTGAA TT 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GGCAAAAGTT TGACGTGCCT CGAGAAGAGG GTCTAT 
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(2) INFORMATION FOR SEQ ID NO: 23: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TTGTCCCGCT TTATTCTGGC CAACATACAT CCATTT 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 29 
CCCAATCCCA AGCTTTACCA YCGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CCCAATCCCA AGCTTTATCT CSGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GATCCATATC GACGTCTGCA TACAGTAAAT AATC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
GATCCATATC GACGTCTGCA TAAATTAAAT AATC 
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